计算机与现代化 ›› 2012, Vol. 1 ›› Issue (11): 197-199.doi: 10.3969/j.issn.1006-2475.2012.11.048

• 网络与通信 • 上一篇    下一篇

全文搜索引擎快速搭建的设计与实现

李国芳   

  1. 广州卫生学校,广东 广州 510180
  • 收稿日期:2012-07-24 修回日期:1900-01-01 出版日期:2012-11-10 发布日期:2012-11-10

Design and Implementation of Full-text Search Engine Fast Building

LI Guo-fang   

  1. Guangzhou Health School, Guangzhou 510180, China
  • Received:2012-07-24 Revised:1900-01-01 Online:2012-11-10 Published:2012-11-10

摘要: 搜索引擎的诞生,给信息搜集带来了极大的方便与好处。一套完备、成熟的搜索引擎的开发需要耗费大量资源,本文围绕如何快速搭建一个简易的搜索引擎展开。基于各开源组织独立研发并对外提供的搜索引擎组件与框架,本文在JBuilder开发平台上调用各组件对外提供的Java API,快速地搭建起由数据抓取、建立索引及执行搜索3大部分组成的简易的全文搜索引擎,实现网页文档类数据的抓取与保存、文本提取、索引文档及索引库的建立、基本关键词的检索等功能,并描述搜索引擎实现及运行的一般过程。

关键词: 信息, 搜索引擎, 全文搜索

Abstract: The birth of search engine brings great convenience and benefits for information collection, but the development for a set of complete and mature search engine needs to consume a large amount of resources, this paper focuses on how to quickly build a simple search engine. Based on the search engine components and frameworks which are independently developed and provided by open source organizations, the Java API on JBuilder development platform is called to quickly build a simple full-text search engine composed of three major components of the data capture, indexing and search, which realizes the webpage document data capture and preservation, text extraction, establishment of index documents and index databases, basic keyword search and other functions, and describes the general process of search engine achievement and operation.

Key words: information, search engine, full-text search

中图分类号: